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Amendments to the claims : 

This listing of claims replaces all prior versions and listing of claims 
in the application: 

Listing of Claims : 

1. (Currently Amended) A method for administration and 
replication of a database, comprising the stops of ; 

providing a database rhanagement system with a built-in random 
sampling facility configured as an integral part of said database management 
system, whereby the random sampling facility has access to low level functions 
and bufFers of the database management system; andr 

executing said random sampling facility from within the database 
management to perform a replication operation on said database; 

defining a database record sample size S: 

randomly sampling S records of the database using said random 
sampling facility: 

storing statistics for each of said S records, wherein said statistics 
include a record key for each record: and. 

producing an extrapolated replication oartition analysis based on 
said statistics, wherein said replication operation is based on said extrapolated 
replication partition analysis . 

2. (Canceled) 

3. (Currently Amended) The method as set forth in claim [[2]] 1, 
wherein tho step of said defining said sample size S includes: 

defining a default sample size; 

selectively receiving a desired sample size; and, 

setting said sample size S as said default sample size when the 
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desired sample size is not selectively received, and setting said sample size S as 
said desired sample size when the desired sample size is selectively received. 

4. (Currently Amended) The method as set forth in claim 1 f 
further comprising the stops of : 

defin i ng a database record sample oiz o S; 

randomly samp l ing S records of the database uoing said random 
sa mpl i ng facil i ty ; 

storing statistics for each of pa i d S records, whoro i n sold statistics 
i nclude q record key for each record; and, 

producing a partial replication partition analysis based on said 

statistics. 

5. (Currently Amended) The method as set forth in claim 4, 
wherein tho stop of said defining said sample size S includes: 

defining a default sample size; 
selectively receiving a desired sample size; and, 
setting said sample size S as said default sample size when the 
desired sample size is not selectively received, and setting said sample size S as 
said desired sample size when the desired sample size is selectively received. 

6. (Currently Amended) A method for database administration 
and replication, comprising tho stops of : 

providing a database management system with an integrated a 
random sampling facility as an integral part , of the database management 
system, the random sampling facility having access to low level functions and 
buffers; 

selecting a default sample size value S . wherein S is a value of said 
selected default sample size ; 

selectively receiving a desired sample size value D and setting said 
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default sample sire value S to said desired sample size value D when said 
desired sample size value D is received; 

randomly sampling S records of the database using 
said random sampling facility; 

storing statistics for each of said S records, wherein said statistics 
include a record key for each record; and, 

producing at least one of an extrapolated replication partition 
analysis based on said statistics? and a partial replication partition analysis based 
on said statistics; and, 

performing a replication operation on said database. 

7. (Currently Amended) A method for database, administration 

and replication, comprising: 

providing a database management system with an integrated a 
random sampling, facility as an integral part of the database management 
system, the random sampling facility having access to low level functions and 
buffers; 

selecting a default sample size value S . wherein S is a value of said 
selected default sample size ; 

selectively receiving a desired sample size value D and setting said 
default sample size value S to said desired sample size value D when said 
desired sample size value D is received; 

randomly sampling S records of the database using 
said random sampling facility; 

storing statistics for each of said S records, wherein said statistics 
include a record key for each record; and, 

producing at least one of an extrapolated replication partition 
analysis based on said statisticsf and a partial replication partition analysis based 
on said statistics; and, 

performing a replication operation on said database, wherein the 
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selecting said default sample size value D further includes: 

generating a table of S number pairs (Yjjj) ] ~ I, 2 S, wherein 

ail Y and all I are initially set to zero; 

initializing a reservoir of records to an empty state; 
setting an index M to said reservoir equal to zero; 
genera ting a sequence of N non-repeating random numbers Ui,U2, 
. . . , Un, 0<U<l f wherein N is the number of records in the database; and, 

performing additional steps for each random number U k generated, 

k-l,2 N, the additional steps including: 

skipping the next record in the database if 
U k is less than the smallest value of Y in said table of S number pairs; and, 

updating the table of S number pairs if a Y less than U k 
exists by performing further steps including: 

setting M equal to its current value plus one; 
replacing the smallest Y in the table of S number pairs with U k ; 
setting the I value paired with the smallest Y equal to M; and, 

storing all or part of the next record of the database in said 
resen/oir of stored records, wherein the current value of M is a reservoir index to 
said stored record. 

8. (Currently Amended) The method as set forth in claim 7, 
wherein the step of updating the table of S number pairs further includes tho stop 
ef: 

arranging the table in a heap with respect to Y. 

9. (Currently Amended) The method as set forth in daim 6, 

further comprising tho stop of : 

sorting said stored statistics by key prior to producing said at least 
one of said extrapolated replication partition analysis and said partial replication 
partition analysis partition analysis. 
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10. (Currently Amended) The method as set forth in claim 9, 
further comprising tho stops of : 

accessing alt database records in an arbitrary sequence; 
iteratrvely filling all of said partitions except the a last sete partition 
with said accessed records to a maximum byte count; and, 

storing remaining accessed records in the last of said partitions 

partition . 

11. (Currently Amended) The method as set forth in claim 6, 
wherein the stop of said storing statistics includes storing said statistics in a 
memory. 

12. (Currently Amended) The method as set forth in claim 11, 
wherein the stop of said storing statistics includes storing said statistics in said 
memory in a compressed format. 

13. (Currently Amended) The method as set forth in claim 6, 
wherein tho stop of said producing at least one of said partition analyses 
includes the step of defining multiple partition boundaries. 

14. (Currently Amendedd) The method as set forth in claim 6, 
wherein the — step — ef said sampling said S records includes randomly 
sampling the S records utilizing dataspaces including: 

at least one index dataspace; 

at least one key dataspace; and, , 

at least one statistics dataspace. 

15. (Currently Amended) A database management system 
(DBMS) for managing an associated database, the DBMS comprising: 
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random sampling facility configured as part of the database 
management system and having access to low .eve. functions and buffers of the 

database management system; 

first database analysis tools using said integrated random sampling 

facility for generating extrapolated reports on database content; 

second database analysis tools using said integrated random 
sampling facility for generating extrapolated reports on database size; a** 

database replication tools adapted to execute at least one of a 
complete replication having output partition sizes determined by 
random sample of said database, and a partial replication in wh.ch the data 
stored in the partial repiication comprises a random sample of said database: 

n r r^L fi '] , 1 rQfi ntimber S dpfininq 8 d6faUlt s ^ m ^ e -^ e -^ 

a means* w Actively rev iving a particular number defining ^ 

sample size .nH setting said number S equal ,o ^^mm^ 
" T^n« for randomly s ^m piinq S recor d s of the database ^sjoa 

s aid random campling facility; . 

g mflans for Et o n™ cities for each of said S record ^whergin 

^ statistics include s r*mrd Key for each record; and, 
HjrganQ fnr producing at l e ast one of: 

- 1 n_r^r- 1 ^ database conte nt analysis based .. o jL^aid 



statistics; 
and. 



^ & vf rg pn i a ted partition *™' v*is based on said statistics; 
n par ti ?i partition analY -^H t— H ** ifl statistics. 
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16. (Canceled) 

17 . (Previously Presented) A database management system of 
claim [[1 6]] IS, further comprising: 
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a means for sorting said stored statistics by key prtor to producing 
at least one of said analyses. 

18 (Previously Presented) A database management system 
(DBMS) for managing an associated database, the DBMS comprising-. 

random sampling faoMy configured as part of the database 
management system and having eccess to low level functions and buffers of the 

database management system; 

first database analysis tools using said integrated random sampling 
facility for generating extrapolated reports on database content; 

second database analysis tools using said integrated random 
sampling facility for generating extrapolated reports on database size; 

database replication tools adapted to execute at least one of a 
compiete replication having output partition sizes determined by 
random sample of said database, and a part*, replication in wh.ch the data 
stored in the partial replication comprfees a random sample of sa.d database; 
a pre-conf.gured number S defining a default sample s«ze; 
a means for selectively receiving a particular number defining a 
desired sample size and setting said number S equal to said part.cu.ar number; 

a means for randomly sampling S records of the database usmg 

said random sampling facility; 

a means for storing statistics for each of said S records. where,n 

said statistics include a record key for each record; and. 
a means for producing at least one of. 

an extrapolated database content analysis based on said 

an extrapolated partition analysis based on said statistics; 

a partial partition analysis based on said statistics, wherein 
said means for randomly sampling S records further comprises: 
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a means for generating a table of S number pairs (Yplj), j=l,2,... f S, 
wherein all Y and all I are initially zero; 

a means for initializing a reservoir of records to an empty state; 

a means for setting an index M to said reservoir equal to zero; 

a means for generating a sequence of N non-repeating random 
numbers Ui, U2, . . - , U N> 0<U<1, wherein N is the number of records in the 
database; and, 

a means, for each random number U* generated, k=l,2 ( ..-,N; 

comprising: 

a means to skip the next record in said database if U k is less 
than the smallest value of Y in said table of number pairs; and, 

a means to update the table if a Y less than U k exists, 

comprising: 

a means to set M equal to its current value plus one; 

a means to replace the smallest Y in the table with U k ; 

a means to set the I value paired with the smallest Y equal to 

M; and, 

a means to store all or part of the next record of said 
database in said reservoir of stored records, whereirKhe current value of M is a 
reservoir index to said stored record. 

19. (Original) The database management system of claim 18 
wherein the means to update the table further comprises: 

a means to arrange the table in a heap with respect to Y. 

20. (Original) The database management system of claim 18, 
wherein said means for storing statistics comprises a means for storing said 
statistics in memory. 

21. (Original) The database management system of claim 20, 
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further comprising: 

a means for sorting said stored statistics by key prior to producing 
at least one of said analyses, 

22. (Original) The database management system of claim 21, 
wherein said partition analyses include analyses of multiple partition boundaries. 

23. (Original) The database management system of claim 22, 
further comprising: 

a means for accessing all database records in an arbitrary 

sequence; 

a means for iteratively filling all of said partitions except the last with 
said accessed records to a maximum byte count; and, 

a means for storing remaining accessed records in the last of said 

partitions. 

24. (Original) The database management system of claim 16, 
further comprising: 

a means for utilizing at least one index dataspace; 
a means for utilizing at least one key dataspace; and, 
a means for utilizing at least one statistics dataspace. 
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